Berno, Anthony J. 

Application/Control Number: 10/768,788 
Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims: 

1. (currently amended) A computer-implemented method for characterizing an 
interrogation position in a-nucleic acid segmen ts collected from a case group and a control 
group , wherein the interrogation position contains a biallelic polymorphism, comprising: 

inputting into a computer system a first measure of relative allele frequency at the 
interrogation position in the anucleic acid segment derived from a first sample collected from 
a fifst the case g roup of n individuals, wherein n is an integer equal to or larger than 2 , and 
wherein individuals in the case group are selected based on a phenotypic characteristic of 
interest ; 

inputting into the computer system a second measure of relative allele frequency at the 
interrogation position in the ajiucleic acid segment derived from a second sample collected 
from a second the control g roup of m individuals, wherein m is an integer equal to or larger 
than 2 , and wherein individuals in the control group are selected based on lack of the 
phenotypic characteristic of interest ; and 

analyzing in the computer system the first measure and the second measure to 
characterize the interrogation position ; and 

outputting a result of said analyzing, wherein the result characterizes the interrogation 

position as being associated with the phenotypic characteristic of interest. 

2. (canceled) 

3. (currently amended) The method of claim 31, wherein the individuals in the fest case 
and second control g roups are animals. 

4. (currently amended) The method of claim 31, wherein the individuals in the fest case 
and second control groups are mammals. 
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5. (currently amended) The method of claim 3i, wherein the individuals in the fest case 
and s e cond control groups are humans. 

6. (canceled) 

7. (currently amended) The method of claim 61, wherein the phenotypic characteristic of 
interest is susceptibility or resistance to a disease, a disorder, or infection of a pathogen. 

8. (currently amended) The method of claim 61, wherein the phenotypic characteristic of 
interest is susceptibility or resistance to effects of uptake of food or drink. 

9. (original) The method of claim 8, wherein the drink is an alcoholic drink. 

10. (currently amended) The method of claim 61, wherein the phenotypic characteristic of 
interest is susceptibility to, resistance to, or adverse effects of a therapy using a therapeutic 
agent or a medical device. 

11. (currently amended) The method of claim 61, wherein phenotypic characteristic of 
interest is selected from the group consisting of cancer, hematological disorders, autoimmune 
diseases, inflammatory diseases, cardiovascular diseases, liver diseases, neurodegenerative 
diseases, diabetes, kidney disorder, gastrointestinal disorders, pain, bacterial infection, 
parasitic infection, viral infection, and a specific stage of development thereof. 

12. (currently amended) The method of claim 61, wherein the number of individuals in the 
case group, n, and the number of the individuals in the control group, m, are each larger than 
5. 

13. (currently amended) The method of claim 6i, wherein the number of individuals in the 
case group, n, and the number of the individuals in the control group, m, are each larger than 
100. 
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14. (currently amended) The method of claim 61, wherein the number of individuals in the 
case group, n, and the number of the individuals in the control group, m, each independently 
varies between lOfMl and 100,000. 

15. (currently amended) The method of claim 61, wherein the interrogation position is a 
SNP position. 

16. (currently amended) The method of claim 15, wherein the-a_£enetic region containing 
the SNP position was not previously known to be associated with the phenotypic 
characteristic of interest. 

17. (currently amended) The method of claim 1, wherein the first and second samples are 
each collected by pooling biological samples from the fes tcase group and seeen dthe control 
group of individuals, respectively. 

18. (original) The method of claim 17, wherein the biological samples are pooled by 
combining a substantially equal amount of biological sample from each individual in each 
group. 

19. (original) The method of claim 17, wherein the biological samples are selected from 
the group consisting of genomic DNA, mitochondrial DNA, extragenomic DNA, cDNA, and 
RNA. 

20. (currently amended) The method of claim 17, wherein the biological samples from the 
fas tcase group and seeen dthe control groups are labeled with a detectable marker. 

21. (original) The method of claim 20, wherein the detectable marker is selected from the 
group consisting of cychrome, fluorescein, Alexa-488, radioisotopes, and biotin. 
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22. (currently amended) The method of claim 1, wherein the first sample is collected by 
pooling genetic material from the fest- case g roup of individuals and amplifying the pooled 
genetic materials m-from the fasf -case g roup and the second sample is collected by pooling 
genetic material from the second control g roup of individuals and amplifying the pooled 
genetic materials in -from the seeend- control g roup. 

23. (currently amended) The method of claim 22, wherein amplicons of genetic material 
from the fesf -case g roup is -are labeled with a different detectable marker than are amplicons 
of genetic material from the second control g roup. 

24. (currently amended) The method of claim 23, wherein amplicons of genetic materials 
from the fifs ^case g roup are each labeled with biotin; and amplicons of genetic materials from 
the seeend rcontrol g roup are each labeled with fluorescein. 

25. (original) The method of claim 23, wherein the first sample and second sample are 
mixed after being labeled, followed by hybridization to an array and subsequent staining, 
wherein said first sample is stained with different stain than said second sample. 

26. (original) The method of claim 1, wherein the first sample is collected by amplifying 
genetic material from each individual in the fest -case g roup of individuals and then pooling 
the amplified genetic materials from all of the individuals in the fast -case g roup, and the 
second sample is collected by amplifying genetic material from each individual in the second 
control g roup of individuals and then pooling the amplified genetic materials from all of the 
individuals in the second control g roup. 

27. (canceled) 

28. (original) The method of claim 1, wherein the first measure of relative allele frequency 
is derived from a first measure of intensity of signals from a first probe set on a first 
oligonucleotide array; and the second measure of relative allele frequency is derived from a 
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second measure of intensity of signals from a second probe set on the first or a second 
oligonucleotide array. 

29. (currently amended) The method of claim 28, wherein the density of the 
oligonucleotide array is at least 100 probes per square centimeters. 

30. (currently amended) The method of claim 28, wherein the density of the 
oligonucleotide array is at least 1000 probes per square centimeters. 

31. (currently amended) The method of claim 28, wherein the density of the 
oligonucleotide array is between 100-100,000,000 probes per square centimeters. 

32. (currently amended) The method of claim 28, wherein the density of the 
oligonucleotide array is between 1,000,000-80,000,000 probes per square centimeters. 

33. (original) The method of claim 28, wherein each of the oligonucleotides on the array is 
10-100 nucleotides in length. 

34. (original) The method of claim 28, wherein each of the oligonucleotides on the array is 
20-50 nucleotides in length. 

35. (original) The method of claim 28, further comprising: 
correcting the first and second measures of intensity. 

36. (original) The method of claim 35, wherein the step of correcting includes subtracting 
a background intensity. 

37. (original) The method of claim 36, wherein the background intensity is the intensity of 
a probe cell having the 1000 th lowest intensity on the oligonucleotide array. 
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38. (original) The method of claim 36, wherein the background intensity is determined by 
calculating an equation of the form: 

<IT>+<IT> 

2 

39. (original) The method of claim 28, further comprising: 

evaluating the first and second measures of intensity to determine whether the first 
probe set and the second probe set have each detected the nucleotide sequences that they were 
designed to detect. 

40. (original) The method of claim 39, wherein the step of evaluating includes 
determining if the measure of intensity of a perfectly complementary probe in the first probe 
set is greater than any of the measures of intensity of mismatch probes in the first probe set, in 
which case the first probe set is a conforming probe set; and determining if the measure of 
intensity of a perfectly complementary probe in the second probe set is greater than any of 
the measures of intensity of mismatch probes in the second probe set, in which case the 
second probe set is a conforming probe set 

41. (original) The method of claim 28, wherein the first or second probe set is included in 
a probe tiling that comprises 

a reference tiling comprising a set of reference oligonucleotide probes with a varying 
nucleotide at the interrogation position and a set of complementary reference oligonucleotide 
probes that are each complementary to the corresponding reference oligonucleotide probes; 
and 

an alternate tiling comprising a set of alternate oligonucleotide probes with a varying 
nucleotide at the interrogation position and a set of complementary alternate oligonucleotide 
probes that are each complementary to the corresponding alternate oligonucleotide probes. 

42. (original) The method of claim 41, wherein the reference and alternate oligonucleotide 
probes differ from each other by a nucleotide at the interrogation position. 



7 



Berno, Anthony J. 

Application/Control Number: 10/768,788 

43. (original) The method of claim 41, wherein the varying nucleotide at the interrogation 
position is A, T, G, U or C. 

44. (original) The method of claim 43, further comprising: 
calculating a number of conforming probe sets for the reference tiling; 
calculating a total number of probe sets in the reference tiling; 

calculating a conformance value for the reference tiling by dividing the number of 
conforming probe sets with the total number of probe sets in the reference tiling; 

calculating a number of conforming probe sets for the alternate tiling; 

calculating a total number of probe sets in the alternate tiling; and 

calculating a conformance value for the alternate tiling by dividing the number of 
conforming probe sets with the total number of probe sets in the alternate tiling. 

45. (original) The method of claim 44, further comprising: 

discarding any measure of intensity obtained from the tiling that has the conformance 
value lower than 0.6. 

46. (original) The method of claim 44, further comprising: 

discarding any measure of intensity obtained from the tiling that has the conformance 
value lower than 0.9. 

47. (original) The method of claim 28, wherein said first probe set and said second probe 
set are the same probe set. 

48. (currently amended) The method of claim 1, wherein the relative allele frequency is 
based on variation of a nucleotide at the interrogation position of the nucleic acid segments 
relative to a reference nucleic acid segment, the nucleotide at said interrogation position in 
said reference nucleic acid segment being designated a reference allele and a nucleotide at 
said interrogation position that differs from said reference allele being designated an alternate 
allele; and the measure of relative allele frequency is the proportion of either said reference 
allele or said alternate allele at the interrogation position. 
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49. (original) The method of claim 48, wherein the relative allele frequency is determined 
by calculating an equation of the form: 

c,y(c a + c r ) or 
c a /(c a + c r ), 

where c r is the concentration of the reference allele and c a is the concentration of the alternate 
allele. 

50. (withdrawn) The method of claim 48, wherein the measure of relative allele frequency 
is determined by calculating an equation of the form: 

lR/(U + lR)or 
I a /(Ia + Ir), 

where Ir is an intensity of signal from the reference allele, and U is an intensity of signal from 
the alternate allele. 

51. (withdrawn) The method of claim 48, wherein the measure of relative allele frequency 
is determined using measurements of hybridization to perfect match probes. 

52. (original) The method of claim 48, wherein each measure of r elative allele frequency 
is determined using at least two intensity of signal measurements by calculating at least one of 
the equations of the form: 

<I R >/(<I A > + <Ir>) and 
<I A >/(<I A > + <Ir>), 

where <I R > is an average of intensities of signal measurements from the reference allele, and 
<I A > is an average of intensities of signal measurements from the alternate allele. 

53. (withdrawn) The method of claim 52, wherein the averages of intensities of signal 
measurements are arithmetic means of intensities of signal measurements. 

54. (original) The method of claim 52, wherein the averages of intensities of signal 
measurements are trimmed means of intensities of signal measurements. 
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55. (withdrawn) The method of claim 48, wherein each measure of relative allele 
frequency is determined using at least two intensity of signal measurements by calculating at 
least one of the equations of the form: 



where Ir is an intensity of signal measurement from the reference allele, Ia is an intensity of 
signal measurement from the alternate allele, < Ir/(Ia + Ir)> and <W(Ia + Ir)> are averages of 
the ratios of intensities of signal measurements, and n is a number of offsets at which the 
intensity of signal measurements were measured. 

56. (withdrawn) The method of claim 55, wherein the averages of the ratios of intensities 
of signal measurements are arithmetic means of the ratios of intensities of signal 
measurements. 

57. (withdrawn) The method of claim 55, averages of the ratios of intensities of signal 
measurements are trimmed means of the ratios of intensities of signal measurements. 

58. (withdrawn) The method of claim 48, wherein wherein each measure of relative allele 
frequency is determined using at least one intensity of signal measurement that has been 
corrected. 

59. (withdrawn) The method of claim 58, wherein the at least one intensity of signal 
measurement is corrected by subtracting background. 



<Ir/(Ia + Ir)> 




L and 



n 



<I A /(I A + I R )> = 




n 
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60. (withdrawn) The method of claim 59, wherein the background is calculated at least 
from intensity of signal measurements from mismatch probes. 

61. (withdrawn) The method of claim 60, wherein the background is calculated from a 
mean of the intensity of signal measurements from mismatch probes. 

62. (withdrawn) The method of claim 61, wherein the mean of intensities of signal 
measurements from mismatch probes is an arithmetic mean. 

63. (withdrawn) The method of claim 61, wherein the mean of intensities of signal 
measurements from mismatch probes is a trimmed mean. 

64. (original) The method of claim 1, wherein the step of analyzing the first measure and 
the second measure includes determining a difference between the first measure and the 
second measure. 

65. (original) The method of claim 64, further comprising: 

determining if the difference falls within a predetermined percentile of a distribution 
of differences between first measures and second measures. 

66. (original) The method of claim 65, wherein the percentile of distribution is top 20%. 

67. (original) The method of claim 65, wherein the percentile of distribution is top 5%. 

68. (currently amended) The method of claim 1, further comprising: 

inputting into the computer system a further another first measure of relative allele 
frequency that is obtained by repeating the measurement of the first measure of relative allele 
frequency; and 

inputting into the computer system a furth e r another second measure of relative allele 
frequency that is obtained by repeating the measurement of the second measure of relative 
allele frequency, wherein the step of analyzing includes determining, based on the variation in 
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the first measure and further said another first measure and the variation in the second 
measure and further said another second measure, whether the first, farther another first, 
second and farthe r another second measures of relative allele frequency are suitable for use in 
characterizing the interrogation position. 

69. (currently amended) The method of claim 6±, wherein the interrogation position is a 
SNP position at position i within a haplotype block having N different haplotype patterns, and 
each of the first and second measures of relative allele frequency is corrected by solving 
applying a first equation: 

where P, is a corrected relative allele frequency of the interrogation position; N is the total 
number of different haplotype patterns within the haplotype block; is a coefficient having 
a value of +1 if the allele at position i in haplotype pattern y matches a reference allele of the 
SNP and having a value of 0 if the allele at position i matches an alternate allele of the SNP; 
and fj is a haplotype pattern frequency. 

70. (currently amended) The method of claim 69, wherein the corrected relative allele 
frequency, Pu the first e quation is constrained by a second equation: 

N 

!/,='• 

71. (original) The method of claim 70, wherein the J§ has a value ranging from 0 to 1. 

72. (original) The method of claim 1, further comprising: 

calculating a difference between the first measure of relative allele frequency and the 
second measure of relative allele frequency. 

73. (original) The method of claim 72, wherein the interrogation position is a SNP 
position at position i within a haplotype block having N different haplotype patterns, and the 
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difference between the first measure and second measure of relative allele frequency is 
corrected by applying a third equation: 

where APi is a corrected relative allele frequency difference of the interrogation position; N is 
the total number of different haplotype patterns within the haplotype block; m% is a coefficient 
having a value of +0.5 if the allele at position i matches a reference allele of the SNP and 
having a value of -0.5 if the allele at position i matches an alternate allele of the SNP; and Afj 
is a haplotype pattern frequency difference. 

74. (original) The method of claim 73, wherein the third equation is constrained by a 
fourth equation: 

N 

2>/,=o. 

75. (currently amended) The method of claim 61, wherein the step of inputting the first 
measure includes inputting a plurality of first measures, and the step of inputting the second 
measure includes inputting a plurality of second measures. 

76. (currently amended) The method of claim 75, wherein the plurality of first measures 
and the plurality of second measures correspond to duplicated or replicated measurements e» 
of the s ame interrogation position in the nucleic acid segments. 

77. (original) The method of claim 1, wherein said interrogation position is a plurality of 
interrogation positions in the same or different nucleic acid segments, and for each 
interrogation position a said first measure of allele frequency and a said second measure of 
allele frequency are inputted into said computer system and analyzed to characterize said each 
interrogation position. 
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78. (original) The method of claim 75, wherein the step of analyzing includes pairing a 
measure from the plurality of first measures and a measure from the plurality of the second 
measures based on common experimental conditions. 

79. (original) The method of claim 75, wherein the step of analyzing includes analyzing 
using a set of differences of paired measures using a method selected from the group 
comprising: a paired t-test; calculating an Olympic average; determining the median value; 
and all members of the set having the same sign. 

80. (original) The method of claim 75, where the step of analyzing includes calculating 
the mean of the plurality of first measures and the mean of the plurality of second measures. 

8 1 . (original) The method of claim 80, further comprising: 

analyzing the absolute difference between the mean of the plurality of first measures 
and the mean of the plurality of second measures by thresholding the difference between the 
means, wherein if the absolute difference between the mean of the plurality of first measures 
and the mean of the plurality of second measures is equal to or above the threshold value, the 
interrogation position is characterized as being associated with the phenotypic characteristic 
of interest. 

t 

82. (original) The method of claim 81, wherein the threshold value is 0.04. 

83. (original) The method of claim 81, wherein the threshold value is 0.1. 

84. (currently amended) The method of claim 81, further comprising: 

inputting a furth e r another p lurality of first measures and a furth e r another p lurality of 
second measures; 

calculating a mean of the further said another p lurality of first measures and a mean of 
the further said another p lurality of second measures; 
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analyzing the difference between the mean of th e further said another p lurality of first 
measures and the mean of th e further said another plurality of second measures by using a 
further another threshold value. 

85. (currently amended) The method of claim 84, wherein the threshold value is 0.05 and 
the further said another threshold value is 0.10, wherein if the absolute difference between the 
mean of the furth e r said another plurality of first measures and the mean of the further said 
another plurality of second measures is equal to or above the furth e r said another t hreshold 
value, the interrogation position is characterized as being associated with the phenotypic 
characteristic of interest. 

86. (original) The method of claim 75, wherein the step of analyzing includes: 
calculating the standard deviation of the plurality of first measures and the standard 

deviation of the plurality of second measures; and 

analyzing each of the standard deviations of the first and second pluralities of 
measures. 

87. (original) The method of claim 86, wherein each of the standard deviations of the first 
and second pluralities of measures is analyzed by using a chi-squared distribution to 
determine at least one cutoff value for the standard deviations. 

88. (original) The method of claim 87, further comprising: 

discarding the plurality of the first or second measures if the calculated standard 
deviation of the plurality of the first or second measures is greater than the cutoff value. 

89. (original) The method of claim 75, further comprising: 

calculating an arithmetic mean and standard deviation of the plurality of the first 
measures, and an arithmetic mean and standard deviation of the plurality of second measures, 
and 

applying a t-test using said arithmetic means and standard deviations to determine 
whether it is likely that the plurality of first measures and the plurality of second measures are 
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from the same or different distributions, where if the plurality of first measures and the 
plurality of second measures are found likely to be from different distributions, the 
interrogation position in the nucleic acid segment is characterized as being associated with the 
phenotypic characteristic of interest. 

90. (original) The method of claim 75, further comprising: 

calculating an arithmetic mean and standard deviation for the plurality of the first 
measures, and an arithmetic mean and standard deviation for the plurality of second measures, 

determining a distribution for the plurality of first measures based on said arithmetic 
mean and standard deviation of the plurality of first measures, 

determining a distribution for the plurality of second measures based on said 
arithmetic mean and standard deviation of the plurality of second measures, and 

determining a difference between said distribution for the plurality of first measures 
and said distribution for the plurality of second measures, wherein if the difference between 
said distribution of the plurality of first measures and said distribution of the plurality of 
second measures is significant, either positive or negative, the interrogation position in the 
nucleic acid segment is characterized as being associated with the phenotypic characteristic of 
interest. 

91. (original) The method of claim 89, wherein the likelihood that the plurality of first 
measures and the plurality of second measures are from different distributions is assessed by 
using a formula of the form: 



where t is the t-statistic in the t-test analysis; <P'i> and <P' 2 > are arithmetic means of the 
pluralities of first and second measures, respectively; a \ and a 2 are standard deviations of 
the pluralities of first and second measures, respectively; and Ni and N 2 are numbers of 
members of the pluralities of first and second measures, respectively. 

92. (currently amended) The method of claim $91, further comprising: 



t = 



< P\ > - 



< P\> 
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comparing the t-statistic with a Student t-distribution using a number of degrees of 
freedom, which is calculated by using an equation of the form (Ni + N 2 -2), to obtain a p- 
value, where if the p-value is lower than 0.05, the interrogation position in the nucleic acid 
segment is characterized as being associated with the phenotypic characteristic of interest. 

93. (currently amended) The method of claim 92, further comprising: 

repeating the same steps for characterization of the interrogation position in the 
nucleic acid segment on one or more further additional interrogation positions in the same or 
different nucleic acid segment; 

discarding the interrogation positions having p-values higher than a cutoff value, and 
characterizing the remaining interrogation position(s) as being associated with the phenotypic 
characteristic of interest. 

94. (original) The method of claim 75, further comprising: 

calculating absolute values of the plurality of the first measures and calculating 
absolute values of the plurality of the second measures; 

ranking the absolute values of the plurality of the first measures; ranking the absolute 
values of the plurality of the second measures; and 

determining the rank-sum distribution for both the plurality of the first measures and 
the plurality of the second measures. 

95. (original) The method of claim 94, wherein the rank-sum distribution is determined by 
using a Wilcoxson rank-sum test. 

96. (original) The method of claim 95, further comprising: 

determining the p-value for the interrogation position, wherein a confidence level 
higher than a selected value indicates that the interrogation position is a SNP position 
associated with the phenotypic characteristic of interest. 

97. (original) The method of claim 95, further comprising: 
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determining the p-value for the interrogation position, wherein a confidence level 
higher than 99% indicates that the interrogation position is a SNP position associated with the 
phenotypic characteristic of interest. 

98. (currently amended) The method of claim 61, wherein said-aft interrogation position is 
at least two different interrogation positions and the step of inputting the first measure 
includes i) inputting a plurality of first measures corresponding to the different interrogation 
positions, and the step of inputting the second measure includes ii) inputting a plurality of 
second measures corresponding to the same interrogation positions as in step i). 

99. (currently amended) The method of claim 98, further comprising: 

iii) repeating step i) to obtain a further another p lurality of first measures; and 

iv) repeating step ii) to obtain a further another p lurality of second measures. 

100. (currently amended) The method of claim 99, further comprising: 

calculating the difference between the plurality of the first measures and the plurality 
of the second measures for each of the different interrogation positions; 

identifying the interrogation positions having a difference above a first threshold 
value, and the interrogation positions having a difference below a second threshold value 
between the plurality of the first measures and the plurality of the second measures; 

calculating the difference between the further said another p lurality of the first 
measures an d the furth e r said another p lurality of the second measures for each of the 
different interrogation positions; 

identifying the interrogation positions having a difference above the first threshold 
value, and the interrogation positions having a difference below the second threshold 
value between the further said another p lurality of the first measures and the further said 
another plurality of the second measures, 

characterizing those interrogation positions that are identified as having a difference 
above the first threshold value in both the plurality and further said another p lurality of 
measures as being associated with the phenotypic characteristic of interest, and 
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characterizing those interrogation positions that are identified as having a difference 
below the second threshold value in both the plurality and fethe ^said another plurality of 
measures as being associated with the phenotypic characteristic of interest. 

101. (original) The method of claim 100, wherein the first threshold value separates the top 
20% in the distribution of the calculated differences from the bottom 95%, and the second 
threshold value separates the bottom 20% in the distribution of the calculated differences from 
the top 95%, wherein those interrogation positions that have a difference in the top 20% or the 
bottom 20% of the distribution of calculated differences are characterized as associated. 

102. (original) The method of claim 100, wherein the first threshold value separates the top 
5% in the distribution of the calculated differences from the bottom 95%, and the second 
threshold value separates the bottom 5% in the distribution of the calculated differences from 
the top 95%, wherein those interrogation positions that have a difference in the top 5% or the 
bottom 5% of the distribution of calculated differences are characterized as associated. 

103. (currently amended) The method of claim 61, further comprising: 
validating the characterization of the interrogation position. 

104. (original) The method of claim 103, wherein the step of validating includes identifying 
a location of the nucleic acid segment in the human genome. 

105. (original) The method of claim 104, wherein if the nucleic acid segment is located in a 
coding or regulatory region of a gene and the interrogation position was designated as 
associated with the phenotypic characteristic of interest, then the gene is deemed to be 
associated with the phenotypic characteristic of interest and the step of validating further 
includes cloning and expressing the associated gene to produce a protein product and 
characterizing the protein product. 

106. (original) The method of claim 104, wherein if the nucleic acid segment is located in a 
coding or regulatory region of a gene, and the interrogation position was designated as 
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associated with the phenotypic characteristic of interest, then the gene is deemed to be 
associated with the phenotypic characteristic of interest and the step of validating further 
includes regulating expression of the associated gene in cells in vitro or in vivo and detecting 
changes of cells in response to the regulation. 

107. (original) The method of claim 104, wherein if the nucleic acid segment is located in a 
coding or regulatory region of a gene and the interrogation position was designated as 
associated with the phenotypic characteristic of interest, then the gene is deemed to be 
associated with the phenotypic characteristic of interest and the step of validating further 
includes: 

screening a library of pharmaceutical candidates against the associated gene or gene 
product; and 

selecting the pharmaceutical candidates that modulate expression of the associated 
gene or activity of the associated gene product. 

108. (currently amended) A method for determining a relative allele frequency for an 
interrogation position in a pool of nucleic acid segments based on nucleotide variation at the 
interrogation position el^-in the nucleic acid segments relative to a reference nucleotide 
sequence nucleic acid segmen t , wherein a nucleic acid segment that is identical to the 
reference nucleotide sequence at the interrogation position is designated as a reference nucleic 
acid segment and-the a_nucleic acid segment with ^nucleotide variation relative to the 
reference nucleotide sequence at the interrogation position is being designated as an alternate 
nucleic acid segment, and further wherein the interrogation position contains a biallelic 
polymorphism, comprising: 

determining a plurality of intensities of signals from the— reference nucleic acid 
segmen ts in the pool of nucleic acid segments , which wherein the plurality of intensities of 
signals from the reference nucleic acid segments are designated as I Rj i_i wherein i is an integer 
equal to or larger than 2; 

determining a plurality of intensities of signals from the-alternate nucleic acid 
segmen ts in the pool of nucleic acid segments , which wherein the plurality of intensities of 
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signals from the alternate nucleic acid segments are designated as I A ,i-j wherein j is an integer 
equal to or larger than 2; 

determining the relative allele frequency of a» the interrogation position by calculating 
an equation of the form: 

< Ir,i-i >/(< U,i-j> + < Ir,i-i >) or 
< I A ,i-j>/(< Ia,h> + < Ir,i-i >), 
where <Ir,i-i> is the average of the plurality of intensities of signals from the reference nucleic 
acid segments, and < Uj-j > is the average of the plurality of intensities of signals from the 
alternate nucleic acid segments. 

109. (original) The method of claim 108, wherein the signals result at least from perfect 
match probes on an oligonucleotide array. 

1 10. (currently amended) The method of claim 108, wherein the average of the plurality of 
intensities of signals is an arithmetic mean of the plurality of intensities of signals. 

111. (currently amended) The method of claim 108, wherein the average of the plurality of 
intensities of signals is a trimmed mean of the plurality of intensities of signals. 

1 12. (original) The method of claim 108, wherein the average of the plurality of intensities 
of signals from the reference nucleic acid segment, <lR,i_i>, excludes outlier signals; and the 
average of the plurality of intensities of signals from the alternate nucleic acid segment, <I A ,i- 
j>, excludes outlier signals. 

113. (original) The method of claim 108, wherein at least one of <I R ,i-i> or <I A ,i-j> is 
corrected. 

114. (original) The method of claim 113, wherein at least one of <lR,i-i> or <I A ,i-j> is 
corrected for background. 
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115. (original) The method of claim 114, wherein at least one of <Ir,i-i> and <I A ,i-j> are 
corrected for background and the relative allele frequency of an interrogation position is 
determined by calculating an equation of the form: 

«/r>- </r>+</r> ) 



^ T mm j j mm ^ T mm >^_i_ T mm 

(</r> - <Z* >+ <i* > )+(</r> _ </^ >+</a > ) 



wherein <lR pm > is the average of the plurality of intensities of signals from the reference 
nucleic acid segment generated by perfect match probes on an oligonucleotide array; <Ia pit1 > 
is the average of the plurality of intensities of signals from the alternate nucleic acid segment 
generated by perfect match probes on the oligonucleotide array; ^r 11111 ^ is the average of the 
plurality of intensities of signals from the reference nucleic acid segment generated by 
mismatch probes on the oligonucleotide array; and kIa™^ is the average of the plurality of 
intensities of signals from the alternate nucleic acid segment generated by mismatch probes 
on the oligonucleotide array. 



116. (withdrawn) The method of claim 115, wherein the average of intensities of signals is 
an arithmetic mean of intensities of signals. 

117. (original) The method of claim 115, wherein the average of intensities of signals is a 
trimmed mean of intensities of signals. 

118. (currently amended) A computer-implemented method for characterizing a 
polymorphic marker in a nucleic acid as being associated with a phenotvpic trait of interest , 
comprising: 

inputting into a computer system a first measure of relative allele frequency for the 
polymorphic marker in a first sample, said first sample containing nucleic acids from a first 
group of n individuals, wherein n is an integer equal to or larger than 2; 

inputting into a computer system a second measure of relative allele frequency for the 
polymorphic marker in a second sample, said second sample containing nucleic acids from a 
second group of m individuals, wherein m is an integer equal to or larger than 2; and 
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analyzing in the computer system the first measure and the second measure to 
characterize the polymorphic marke r; and 

outputting a result of said analyzing, wherein the result characterizes the interrogation 

position as being associated with a the phenotypic trait of interest. 

119. (withdrawn) A computer-implemented method for characterizing an interrogation 
position in a nucleic acid segment, comprising: 

inputting into a computer system a group of first measures of hybridization probe 
intensities corresponding to the interrogation position in the nucleic acid segment derived 
from a first sample collected from a case group of n individuals, wherein n is an integer equal 
to or larger than 2; 

calculating in the computer system a group of first relative allele frequencies at the 
interrogation position for the case group based on the group of first measures of hybridization 
probe intensities; 

inputting into the computer system a group of second measures of hybridization probe 
intensities corresponding to the interrogation position in the nucleic acid segment derived 
from a second sample collected from a control group of m individuals, wherein m is an 
integer equal to or larger than 2; 

calculating in the computer system a group of second relative allele frequencies at the 
interrogation position for the control group based on the group of second measures of 
hybridization probe intensities; 

analyzing in the computer system the group of the first relative allele frequencies and 
the group of second relative allele frequencies to characterize the interrogation position as 
being associated with a phenotypic characteristic of interest. 

120. (withdrawn) The method of claim 119, wherein the group of the first measures of 
hybridization probe intensities are obtained by at least 2 sets of repetitive experiments, and the 
group of the second measures of hybridization probe intensities are obtained by at least 2 sets 
of repetitive experiments. 

121. (withdrawn) The method of claim 119, wherein the step of analyzing includes: 
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calculating a mean of the group of the first relative allele frequencies; 
calculating a mean of the group of the second relative allele frequencies; and 
calculating the absolute difference between the mean of the group of the first relative 
allele frequencies and the mean of the group of the second relative allele frequencies, 
where if the absolute difference is equal to or above a predetermined threshold value, the 
interrogation position is characterized as being associated with the phenotypic characteristic 
of interest. 

122. (withdrawn) The method of claim 121, wherein the threshold value ranges from 0.02 
to 0.2. 

123. (withdrawn) The method of claim 121, wherein the threshold value ranges from 0.04 
to 0.1. 

124. (withdrawn) The method of claim 119, wherein the step of analyzing includes 
applying a statistical test to the group of the first relative allele frequencies and the group of 
second relative allele frequencies to characterize the interrogation position as being associated 
with a phenotypic characteristic of interest. 

125. (withdrawn) The method of claim 124, wherein the statistical test is a t-test or rank- 
sum test. 

126. (withdrawn) The method of claim 119 wherein said an interrogation position is at least 
two interrogation positions, and further comprising: 

calculating the difference between the group of the first measures and the group of the 
second measures for each of the different interrogation positions; 

characterizing the interrogation positions having a difference above a first threshold 
value, and interrogation positions having a difference below a second threshold value between 
the group of the first measures and the group of the second measures as associated with the 
phenotypic characteristic of interest. 
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127. (withdrawn) A computer-implemented method for characterizing an interrogation 
position in a nucleic acid segment with a previously known location in the human genome, 
comprising: 

inputting into a computer system a first measure of relative allele frequency at the 
interrogation position in the nucleic acid segment derived from a first sample collected from a 
first group of n individuals, wherein n is an integer equal to or larger than 2; 

inputting into the computer system a second measure of relative allele frequency at the 
interrogation position in the nucleic acid segment derived from a second sample collected 
from a second group of m individuals, wherein m is an integer equal to or larger than 2; and 

analyzing in the computer system the first measure and the second measure to 
characterize the interrogation position. 

128. (withdrawn) The method of claim 127, wherein analyzing includes analyzing in the 
computer system the first measure and the second measure to characterize the interrogation 
position as being associated with a phenotypic characteristic of interest. 

129. (withdrawn) The method of claim 128, wherein the nucleotide segment is proximal to 
or within a region of a candidate gene that is not previously known to be associated the 
phenotypic characteristic of interest. 

130. (withdrawn) The method of claim 128, wherein the nucleotide segment is proximal to 
or within a region of a candidate gene that is previously suspected of being associated the 
phenotypic characteristic of interest. 

131. (withdrawn) The method of claim 128, wherein the nucleotide segment is proximal to 
or within an untranslated region of a candidate gene that is not previously known to be 
associated the phenotypic characteristic of interest. 

132. (withdrawn) The method of claim 128, wherein the nucleotide segment is proximal to 
or within an untranslated region of a candidate gene that is previously suspected of being 
associated the phenotypic characteristic of interest. 
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133. (currently amended) Data processing apparatus for characterizing an interrogation 
position in a nucleic acid segment, wherein the interrogation position contains a biallelic 
polymorphism, comprising: 

a data processor; and 

a storage device holding computer readable code in communication with the data 
processor, the computer readable code including: 

computer code which determines a first measure of relative allele frequency at 
the interrogation position in the nucleic acid segment derived from a first sample 
collected from a fes^ -case g roup of n individuals, wherein n is an integer equal to or 
larger than 2 , and wherein the individuals in the case group are selected based on a 
phenotvpic characteristic of interest ; 

computer code which determines a second measure of relative allele frequency 
at the interrogation position in the nucleic acid segment derived from a second sample 
collected from a second control g roup of m individuals, wherein m is an integer equal 
to or larger than 2 , and wherein the individuals in the control group are selected based 
on lack of the phenotvpic characteristic of interest ; and 

computer code which analyzes the first measure and the second measure to 
characterize the interrogation position as being associated with the phenotvpic 
characteristic of interest . 

134. (canceled) 

135. (currently amended) The data processing apparatus of claim 133[[4]], wherein the data 
processing apparatus is further in communication with another data storage device which 
stores a first measure of the intensity of signals from a first probe set on an oligonucleotide 
array and a second measure of the intensity of signals from a second probe set on the 
oligonucleotide array, wherein the first measure of relative allele frequency is determined 
based on the first measure of intensity and the second measure of relative allele frequency is 
determined based on the second measure of intensity. 
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136. (original) The data processing apparatus of claim 135, wherein the data processing 
apparatus is further in communication with an imaging device. 

137. (original) The data processing apparatus of claim 136, wherein the imaging device is a 
scanner that determines the first measure of intensity and the second measure of intensity. 

138. (currently amended) A computer program product comprising a machine r eadable 
medium on which is provided program instructions holding computer readable cod e for 
characterizing an interrogation position in a nucleic acid segment , wherein the interrogation 
position contains a biallelic polymorphism, comprising and for carrying out the processes of : 

code for d etermining a first measure of relative allele frequency at the interrogation 
position in the nucleic acid segment derived from a first sample collected from a first case 
group of n individuals, wherein n is an integer equal to or larger than 2 , and wherein the 
individuals in the case group are selected based on a phenotvpic characteristic of interest ; 

code for determining a second measure of relative allele frequency at the interrogation 
position in the nucleic acid segment derived from a second sample collected from a second 
control g roup of m individuals, wherein m is an integer equal to or larger than 2 , and wherein 
the individuals in the control group are selected based on lack of the phenotvpic characteristic 
of interest ; and 

code for analyzing the first measure and the second measure to characterize the 
interrogation position as being associated with a phenotvpic characteristic of interest . 

139. (canceled) 
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